Single Network Adaptive Critic for Vibration Isolation Control ?

نویسندگان

  • Jia Ma
  • Tao Yang
  • Zeng-Guang Hou
  • Min Tan
چکیده

Vibration isolation control is the critical issue to guarantee the performance of various vibration-sensitive instruments and sensors in practical engineering systems. In this paper, single network adaptive critic (SNAC) based controllers are developed for vibration isolation applications. The SNAC approach differs from the typical action-critic dual network structure in adaptive critic designs (ACDs) by eliminating the action network, which leads to substantial computational savings. Two training methods, i.e., the off-line and online methods are proposed to adapt the SNAC controllers respectively. In contrast with the existing offline SNAC training method, the off-line method proposed in this paper adopts the least mean square (LMS) training algorithm with variable learning rate to make the training procedure converge faster. Furthermore, for real-time control purpose, the online learning method is presented for tuning the weights of the critic networks along the real-time state trajectories of the isolation system. Additionally, the “shadow critic” training strategy used in the online method further improves the convergence rate. Simulation results have shown that the developed SNAC controllers using the different training methods can converge to the continuous-time optimal control solution at satisfactory speed. Moreover, the designed SNAC controllers alleviate vibration disturbance more effectively and have better control performance in comparison with the passive isolator.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Midcourse guidance law with neural networks

A dual neural network ‘adaptive critic approach’ is used in this study to generate midcourse guidance commands for a missile to reach a predicted impact point while maximizing its final velocity. The adaptive critic approach is based on approximate dynamic programming. The first network, called a ‘critic’, network, outputs the Lagrangian multipliers arising in an optimal control formulation whi...

متن کامل

On Adaptive Critic Architectures in Feedback Control

Two feedback control systems are designed that employ the adaptive critic architecture, which consists of two neural networks, one of which (the critic) tunes the other. The first application is a deadzone compensator, where it is shown that the adaptive critic structure is a natural consequence of the mathematical problem of inversion of an unknown function. In this situation the adaptive crit...

متن کامل

Continuous-Time Single Network Adaptive Critic for Regulator Design of Nonlinear Control Affine Systems

An optimal control law for a general nonlinear system can be obtained by solving Hamilton-Jacobi-Bellman equation. However, it is difficult to obtain an analytical solution of this equation even for a moderately complex system. In this paper, we propose a continuoustime single network adaptive critic scheme for nonlinear control affine systems where the optimal cost-to-go function is approximat...

متن کامل

A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems

Even though dynamic programming offers an optimal control solution in a state feedback form, the method is overwhelmed by computational and storage requirements. Approximate dynamic programming implemented with an Adaptive Critic (AC) neural network structure has evolved as a powerful alternative technique that obviates the need for excessive computations and storage requirements in solving opt...

متن کامل

Adaptive PID Controller based on Reinforcement Learning for Wind Turbine Control

A self tuning PID control strategy using reinforcement learning is proposed in this paper to deal with the control of wind energy conversion systems (WECS). Actor-Critic learning is used to tune PID parameters in an adaptive way by taking advantage of the model-free and on-line learning properties of reinforcement learning effectively. In order to reduce the demand of storage space and to impro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008